Efficient Synchronization for Multithreaded Processors

نویسندگان

  • Jeffrey P. Bradford
  • Seth Abraham
چکیده

cycle multithreading. Eliminating this source of synchronization In this paper we describe a source of synchronization overhead by allowing synchronization to be used in places where its unique to multithreaded processors, and present a novel generaloverhead previously precluded its use. We give an example in purpose synchronization scheme that addresses this overhead. Section 2, after we introduce our proposed mechanism and Unlike previous schemes, we reduce the cost associated with compare it to previous implementations. Additionally, it provides synchronization by eliminating the need for busy-waiting or for a performance advantage for existing code. In [WOT95], the software intervention while blocking. This is achieved via a authors measured the percent of the time programs in the combination of multithreading and semaphore operations impleSPLASH-2 benchmark suite spent in synchronization to range up mented in hardware. Once this synchronization mechanism is in to 40% of the execution time (although some of this idle time place, we propose using it to simplify the customary implementacould be addressed by having multiple computation threads per tion of interrupts and exceptions. Lastly, we provide simulation processor). Thus, in Section 3 we present simulation results results from four programs from the SPLASH2 benchmark suite, showing the performance advantage of our scheme. Section 4 is showing performance increases between 2% and 150%. the conclusion.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Towards Extremely Fast Context Switching in a Block-Multithreaded Processor

Multithreaded processors use a fast context switch to bridge latencies caused by memory accesses or by synchronization operations. In the block-multithreaded processor – called Rhamma – load/store, synchronization and execution operations of different threads of control are executed simultaneously by appropriate functional units. A fast context switch is performed, whenever a functional unit co...

متن کامل

Running Parallel Applications on an Mp with Multithreaded Superscalar Processors Running Parallel Applications on a Mp with Multithreaded Superscalar Processors

With lesser returns on adding more complexity to conventional superscalar processors, simultaneous multithreaded (SMT) superscalar processors seem to be a promising alternative. Unfortunately, most previous work has focused on systems running multiprogrammed loads of sequential applications. It is not clear how well these processors work in a shared-memory multiprocessor environment running par...

متن کامل

Timed Colored Petri Net Models of Distributed Memory Multithreaded Multiprocessors

Distributed{memory multithreaded multiprocessors are composed of a number of (multithreaded) processors, each with its memory, and an interconnecting network. The long memory latencies and synchronization delays are tolerated by context switching, i.e., by suspending the current thread and switching the processor to another `ready' thread provided such a thread is available. Because of very sim...

متن کامل

Confidence Based Out-of-Order Renaming for Speculatively Multithreaded Processors

Speculatively multithreaded processors find parallelism by speculatively fetching and renaming dynamic flows of instructions from (perhaps) widely seperated parts of the program flow graph. These processors must handle inter-thread register dependences. The approach followed in this paper is to dynamically identify the consumers of interflow register mappings that will be (but have not yet been...

متن کامل

A New Relaxed Memory Consistency Model for Shared-Memory Multiprocessors with Parallel-Multithreaded Processing Elements

The release consistency model is the generally accepted hardware-centric relaxed memory consistency model because of its performance and implementation complexity. By extending the release consistency model, in this paper, we propose a hardware-centric memory consistency model particularly for shared-memory multiprocessor systems with parallel-multithreaded processing elements. The new model us...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998